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Abstract — In a distributed storage network, reliability and 
bandwidth optimization can be provided by regenerating codes. 
Recently table based regenerating codes viz. DRESS (Distributed 
Replication-based Exact Simple Storage) codes has been proposed 
which also optimizes the disk I/O. Dress codes consists of an outer 
MDS code with an inner fractional repetition (FR) code with 
replication degree p. Several constructions of FR codes based 
on regular graphs, resolvable designs and bipartite graphs are 
known. This paper presents a simple modular construction of 
FR codes. We also generalize the concept of FR codes to weak 
fractional repetition (WFR) codes where each node has different 
number of packets. We present a construction of WFR codes 
based on partial regular graph. Finally we present a simple 
generalized ring construction of both strong and weak fractional 
repetition codes. 

I. Introduction 

Cloud computing has emerged as a fascinating area of 
computing providing various services such as on demand 
videos, music, computing etc. to end users from any computing 
device. Usually data of the user is stored on different data 
centers in a distributed fashion. Many such commercial cloud 
services are available from Amazon, Google and Apple etc. 
EL fZL El- To provide seamless end user experience for 
different applications such as Gmail etc. one has to optimize 
various parameters of the storage system such as reliabil- 
ity, scalability, storage, bandwidth, disk I/O, etc. In such a 
distributed storage system (DSS) data is stored on a set of 
nodes in cloud network, which are not reliable independently. 
Redundancy is added in network to increase the reliability of 
the system. Google File System (GFS) is one such system 
which uses replication of data into various chunk servers (large 
data chunk files) together with replication of logs and master 
index for reliability |4|. Although this scheme is simple but 
inefficient in terms of utilizaion of various storage system 
parameters like bandwidth. One can provide reliability to such 
systems using erasure coding which uses an MDS (Maximum 
Distance Separable) code [5] however these codes fails to 
optimize bandwidth, complexity and disk I/O. To optimize 
these parameters recently regenerating codes were introduced 
and studied in detail by many researchers (6), (7), 0, (9), 

Consider a distributed storage system (DSS) consisting 
of n nodes, where each node can store a symbols from ¥ q 
(finite field with q number of elements). A file size of B 
symbols can be stored in such a DSS either using simply 
a replication (as it is done currently in many cloud storage 
system) of packets or by using an MDS code such that the 



file can be retrieved using erasure codes by contacting any 
k nodes out of those n nodes in network (k < n) Q2J. 
When a node fails, a new code can be generated (known as 
repair of a failed node). In a simple replication system one 
can download it from any other node having a copy of the 
data. This is good bandwidth wise and disk I/O wise but it 
can tolerates only few errors as compare to an erasure code 
system (where a replacement node can contact to k nodes and 
extract the data of that failed node) which can tolerates more 
failures of nodes but it is inefficient for bandwidth and disk 
I/O. To minimize the bandwidth, the concept of regenerating 
code was introduced by Dimakis et al. |6|. In regenerating 
codes instead of downloading whole file B (by contacting all 
k nodes), a node can be repaired by contacting d nodes (d is 
known as the repair degree) and downloading /3 packets from 
each of them out of a packets which are stored on each node. 
This new generated data is functionally equivalent to lost data. 
Thus repair bandwidth becomes d/3. Regenerating codes over 
¥ q are characterized by parameters: {W q , [n, k, d], [a, /?, B]}. 
Bandwidth can be reduced further if each node is willing to 
store more |7 |. Thus there is a tradeoff between storage (a) and 
repair bandwidth (d/3). For internet applications one can search 
for minimum bandwidth repair (MBR) regenerating codes 
and for archival applications one can search for minimum 
storage repair (MSR) codes. These are two extremal points 
on the storage-repair bandwidth tradeoff curve fT3lL 171 . For 
regenerating codes it is well known that 



(1) 



k < d < (n- 1). 



Cut-set bound of network coding gives the following iTTH 
relation between the parameters of regenerating codes. 



k-l 



(2) 



B < ^min{a, (d-i)/3}. 



Clearly we want to minimize both a as well as (j since mini- 
mizing a leads to minimum storage solution and minimizing 
f) leads to a solution which minimizes repair bandwidth. There 
is a trade off between choices of a and f3 as shown in |[T4ll . 
For regenerating codes to be optimal, equality holds in ^ and 
for the MBR (Minimum Bandwidth Regeneration) point, the 
parameter of the code must satisfy ^ (see O). 
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For (3 = 1, the storage capacity (the limit on the maximum 
file size that can be delivered to any user contacting any k out 



of n nodes) in a DSS is given by 031 




Constructions of regenerating codes for /? = 1 achieving 
storage capacity ^ are of special interest. The problem of 
regeneration becomes harder if one wants to have an exact 
repair of the failed node. Rashmi et al. presented an explicit 
construction of exact-Minimum Bandwidth Repair (MBR) 
codes for the parameter set [n,k = n — 2,d = n — 1] by 
fully connected graph [15]. An explicit MBR code is also 
given for all feasible values of the system parameters [n, /c, d] 
and MSR codes for all 2 k — 2 < d < n — 1 based on a 
common product matrix framework [15]. El Rouayheb et al. 
lf]~2l has shown the construction of optimal regenerating codes 
(called Fractional Repetition Codes) by using table-based 
repair model, which is basically a generalization of the codes 
given in fl5l . El Rouayheb et al. constructed the regenerating 
codes for single node failure based on regular graph and 
multiple node failure based on Steiner systems. These codes 
are fast and low-complexity repair codes. It simply reads only 
one of its stored packet and forward it to the replacement 
node with no additional processing. This property is referred as 
uncoded repair. By combining Fractional Repetition code with 
an MDS code a concept of DRESS (Distributed Replication- 
based Exact Simple Storage) codes has been introduced in (8). 
It was an open problem to construct FR codes (and hence 
dress codes) beyond Steiner systems. Several constructions 
of Fractional Repetition Codes (and hence dress codes) are 
known based on bipartite graph [16], resolvable designs [17] 
and regular graphs fl2l . fTSl . In this paper, we present the 
extension of the table based method presented in lfT2l for single 
node failure. We call it weak fractional repetition codes (and 
hence weak dress codes). The construction in this paper gives 
the regenerating code for the values of n and d for which 
regular graph does not exist. We also present some general 
constructions of weak dress codes. 

The rest of the paper is organized as follows. In Section 
2, we summarize popular FR code generation given by El 
Rouayheb et al. lfT2l for single node failure. In Section 3, 
we provide a simple construction of FR code for p = d and 
n = 0. A construction of weak fractional repetition code from 
partial regular graph is given in Section 4. A ring construction 
is given in Section 5 which provides both weak and strong 
fractional repetition codes. Finally Section 6 concludes with 
general remarks. 



satisfying the condition that each element of Q belongs to 
exactly p sets in the collection. 

As MDS code exists for any parameters, the challenging 
part is to get an FR code. In Q2|, FR codes of repetition degree 
2 are designed with an uncoded repair based on regular graph 
which can tolerate at most one node failure. For all feasible 
values of n and d, these codes achieves the capacity Cmbr- 
Either of n and d must be even to get these codes. In fact this 
is the necessary and sufficient condition for constructing codes 
from this method fl2l . 

Proposition 1. The parameter 6 in Def. [7] of an FR code of 
degree p for an (n,k,d) DSS is given by, 

(5) Op = nd. 

Proposition [T] gives a necessary and sufficient condition for 
existence of FR code fT9lL fT2l . The following popular results 
for constructing FR codes are well known from IT2l . l8l . 

Theorem 1. A projective plane of order m gives dress codes 
with n = rn 2 + m + 1 and p = d = m + lby taking points as 
packets and lines as storage nodes. 

Theorem 2. A Steiner system S(2,a,v) gives dress codes with 
parameters p = [^Zi) and n = ^ using lines as storage 
nodes and points as packets. 

In order to understand our work we now describe briefly 
the construction of FR codes based on regular graph fT2l . 

Definition 2. (Regular Graph) : A regular graph is a graph 
where each vertex has the same number of neighbors; i.e. 
every vertex has the same degree. A d-regular graph R Ui d 
of n vertices is the regular graph where all vertices have the 
same degree d. Thus graph R n ,d has ^ vertices. 

Construction 1. To construct a FR code with repetition degree 
p = 2 and with nd even, Algorithm [7] is well known HTH . 



Algorithm 1 FR code construction with p = 2 for {n, d} 
DSS, where nd is even 

1) Generate a d-regular graph R n d on n vertices 
U u U 2: ...,U n . 

2) Index the edges of R n ^ from 1 to 

3) Store on node U{ in the DSS the packets indexed by 
the edges that are adjacent to vertex Ui in the graph. 



II. Previous Work 

El Rouayheb et al. |[T2l presented the table based regener- 
ating codes to reduce complexity. This construction consists 
of containing an outer MDS code with an inner fractional 
repetition code with repetition degree p that can tolerate up 
to p — 1 nodes failing together. In this section, we summarize 
the results of fT2l required for our purpose and also give an 
example of FR code based on regular graph. 

Definition 1. (Fractional Repetition Code) : A Fractional 
Repetition (FR) code C, with repetition degree p, for an 
(n, d) DSS, is a collection C of n subsets U\, L^, • • • , U n 
of a set Vt = {!,...,#}, each having size d, i.e, \Ui\ = d, 



Example 1. Consider a DSS (n = 6, k = 4, d = 5) whose 
storage capacity is 14 according to ([?]). Let ¥ q be the finite 
field of size q. File x = (#1, #2, • • • > #14) £ IFj 4 has 14 
packets, which we want to store on the system. Figure^shows 
a regenerating code which can save the file of 14 packets. 
Regenerating code in Figure [7] is made of two components- 
an outer MDS code (15, 14) and a repetition code based on 
a regular graph shown in Figure [2] When we give file x as a 
input to the outer MDS code, we will get y = (2/1, 2/2? • • • ? Vis) 
as output, where y\, 2/2, • • • Via are same as x±, • . . , #14 
and 7/15 is the parity packet. The coded data packets will be 
saved on 6 different nodes available in the system, according 
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MDS Code 
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Fig. 1. An exact regenerating code for a (6, 4, 5) DSS. The code is formed 
by a (15, 14) parity check MDS code followed by special repetition code. 
This inner repetition code is known as a Fractional Repetition Code of 
repetition degree p = 2. 




Fig. 2. Regular graph for n = 6, d = 5. The labeling of the edges from 1 
to 15 give the FR code with p = 2 for DSS (6,4,5) depicted in Figure [T] 



to inner repetition codes of Figure [7] i.e. node U\ stores 
(2/1,2/6,2/7,2/9,2/13), U 2 stores (2/1,2/2,2/10,2/12,2/14), U 3 stores 
(2/2,2/3,2/7,2/8,2/15), U 4 stores (2/3,2/4,2/10,2/11,2/13), U 5 stores 
(2/4,2/5,2/8,2/9,2/14) and U 6 stores (2/5,2/6,2/11,2/12,2/15). The 
whole file can be retrieved by recovering just 14 distinct 
packets out of these 15, due to the property of outer MDS code. 
As 14 distinct packets can be recovered by contacting to any 
4 nodes out of these 6 nodes so whole file can be retrieved by 
contacting to any 4 nodes. In case of failure of a single node, 
its data can be recovered by contacting exactly 5 nodes and 
downloading 1 packets from each of these nodes. For instance, 
when node U\ fails, it contacts node 112^11^^11^^ U§ and Uq to 
get the packets 2/2,2/7, 2/13, 2/9 an d 2/6 respectively. This example 
is similar to the example in 4721/ . 



III. Modular construction of fractional 

REPETITION CODES FOR p = d AND n = 

In fl2l . El Rouayheb et al. have shown the construction 
of FR codes for repetition degree 3 or more based on Stainer 
system. In this section we give a different approach to get the 
fractional repetition code for p = d and n = 6. 

Proposition 2. Let n and t(> 1) be a positive integers. For 
< j < n - 1, let C? = {t^ 1 + j (mod n + 1)|1 < i < 
p}, where p is a positive integer such that t p ~ 1 < n. Then 
\C™\ = p and C = {C?\0 < j < n - 1} is a FRC over 
Q = {1, 2, . . . , n} with p = d and n = 0. 

Proof: By the construction of it is clear that each such 
set will have size p = d and since = n, the necessary and 
sufficient condition for FRC (equation ([5])) is satisfied. ■ 

A fractional repetition code based on Proposition [2] for n = 
8, t = 2 and p = 3 is given in Figure [3] 
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Fig. 3. Fractional repetition code for n ■ 
Proposition [2] 



: 8, t = 2 and p = 3 based on the 



IV. Construction of Weak Fractional Repetition 
Codes for p = 2 

Definition 3. (Weak Fractional Repetition Code) : A Weak 
Fractional Repetition (WFR) code Cfor a (n, /c, d) DSS, with 
repetition degree p for every packet, is a collection of n 
subsets U\, U2, • • • , U n of a set Q = {1, . . . , 6} such that 
\Ui\ = di, for 1 < i < n, where d = max{<^},<i — d t = 8 t 

n 

is the order of weakness of node Ui and ^2 1^1 = nd — S, 

i=l 

n 

where S = ^2 Si is defined as total order of weakness of C. 

2 = 1 

Remark 1. WFR code with weakness order 5 = gives 
fractional repetition codes of Def. [7] 

Clearly, for WFR Codes, 
(6) p0 = nd- S. 

FR codes for p = 2 (WFR code with weakness order 0) 
has been constructed using regular graph by Ramachandran 



et al. fT2l . Regular graph does not exist for cases where n and 
d both are odd. Motivated by this we define a Partial Regular 
Graph. 

Definition 4. (Partial Regular Graph) : A graph G(n, d) with 
total n vertices such that the degree of some (n — 1) vertices 
is d and one vertex has degree d—1. We call these graphs as 
Partial Regular Graph denoted by PRG(n, d). 

Remark 2. With the abuse of notations, we will use the same 
symbols for graph and it's adjacency matrix. 

A. Properties of Partial Regular Graph 

Let a partial regular graph be denoted by PRG(n, d). The 
rows of its matrix are denoted by Ri and columns are denoted 
by Ci where i G {1, 2, . . . , n} such that degree of every vertex 
is d except one vertex Uk for some k G {1, 2, . . . , n} which 
has degree d — 1. This adjacency matrix of PRG(n, d) has the 
following properties. 

1) PRG(n, d) is a partial regular graph iff both n and 
d are odd. 

2) wt(Ri) = d for all i such that 1 < i < n except for 
i = k and wt(Rk) = d—1. 

3) wt(Ci) = d for all i such that 1 < i < n except for 
i = k and wt(Ck) — d—1. 

So the total degree of the graph becomes even. In fact it is the 
necessary condition for the existence of an undirected graph. 

B. Construction of the Partial Regular Graph from a Circu- 
lant Regular Graph 

Definition 5. (Circulant Graph) : A graph is called as a 
circulant graph C n (d) of degree d if it's adjacency matrix is a 
circulant. i.e. (caj) = (aj-i)for z,j = 0,...,n — 1 (subscripts 
of adjacency matrix are taken modulo n), dij G ¥ q and weight 
of each row is d. 

Circulant matrix is characterized by it's I s * row. If the first 
row of the circulant matrix is (ao, ai, . . . , a n -i) G one 



can form a polynomial q(t) = ao -\- a\t -\- a 2 t 2 



• Cin-lt n 



The algebra of circulant matrix has one-to-one correspondance 
with polynomial algebra in ¥ q [t]/(t n — 1). 

Remark 3. Each circulant graph is a regular graph. 



Let P(tt) 



be the permu- 



1 2 ... n 
tt(1) tt(2) ... 7r(n) 
tation function on n symbols then the corresponding permu- 
tation matrix P n is the n x n matrix whose entries are all 
except that in row i, the entry ir(i) equals 1. We know 
from elementary algebra that to get a permutation matrix P n 
from a permutation P(tt), one can apply permutation function 
P(tt) on the identity matrix I n . Algorithm 2 constructs a 
Partial Regular Graph PRG(n,d) from a circulant graph 
C n (d — 1) whose 1 st row is defined by the polynomial 
q(t) = t + t 2 + . . . + t^- 1 )/ 2 + p-(^-l)/2 + . . . + t n ~\ 
where weight of q(t) = d — 1. 

C. Construction of the WFR Code 



Algorithm 2 Construction of Partial Regular Graph 
PRG(n, d) from Circulant Matrix C n (d - 1) 



1) 

2) 

3) 
4) 



Pick a circulant matrix C n (d — 1) whose first row is 
defined by q(t) = t+t 2 + . . . + ^-i)/2 +t n-(d-i)/2 + 

Get a permutation matrix P n _i whose permutation 
function P(tt) on n — 1 symbols is given by 
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(" + !) 
2 
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("-!) 
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Expand matrix P n _i to make itnxn matrix S n by 
adding at n th row and n t/l column such that S n — 

, where = [0 . . . 0] G Z^" 1 . 



Pn-l 


T 









Add above matrix S n to C n (d — 1) to get the final 
matrix PRG (n , d) . 

Pi?G(n, d) = # n + C n (d - 1) (mod 2). 



graph. Algorithm [2] can be used to construct a a partial 
regular graph which can be used together with Algorithm [3] 
to construct a WFR code. 

Algorithm 3 Construction of the Weak FR code from the 
Partial Regular Graph PPG(n, d) where n and d both are 
odd. 



1) 

2) 
3) 



Generate the above graph PPG(n, d) on n vertices 
U u U 2: ...,U n . 

Index the edges of G n ^ from 1 to nd 2 x . 

Store on node Ui in the DSS the packets indexed by 

the edges that are adjacent to vertex Ui in the graph. 




Fig. 4. Fraction Repetition code for n = 9 and d = 7. Here All vertices 
(except Ug) are have degree 7. Each vertex represents a node and edges 
represents vectors corresponding to the common symbol between the node. 



WFR code from the partial regular graph can be con- 
structed in the same way as in the previous case of regular 



Example 2. To generate the Weak Fractional Repetition Code 
forn = 9 and d = 7, note that q(t) = + t + t 2 +t 3 + 0t 4 + 



ot 5 



t 6 + t 7 + t 8 and hence 
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For the permutation function 

1 2 3 4 5 6 7 



P(tt) = 



5 6 7 



12 3 4 



permutation matrix P% is given by 

/00001000X 

' 1 * 

1 

1 

1 
1 
1 

Voooioooo/ 

Thus by step 3 of Algorithm^ Sq is given by 

/000010000\ 
000001000 
000000100 
000000010 
S 9 = 100000000 
010000000 
001000000 
000100000 

Vooooooooo/ 

Using step 4 of Algorithm^ one gets the following adjacency 
matrix for partial regular graph PRG(9,7) with n = 9 and 
d = 7. 

/ 1 1 1 1 1 1 1 \ 
10 11110 11 
110 11110 1 
1110 11110 
11110 1110 
11110 111 
10 11110 11 
10 11 



PRG(9J) 



1 



110 1 



\ 1 1 1 1 1 1 / 

Adjacency matrix PRG(9 : 7) will give us a graph shown in 
Figure [?] It is a partial regular graph which satisfies the 
properties given in subsection A. This graph will give us the 
Weak Fractional Repetition Code as shown in the Figure [5] 

In this example, suppose we want to store a file x = 
(#i, #2, #3, • • • , £30) £ ¥q° on a DSS (n = 9, k = 7, d = 7). 
There is no regular graph which can give us appropriate frac- 
tional repetition codes. However Figure^shows a partial reg- 
ular graph which gives us weak fractional repetition codes to 
save the file of 30 packets on DSS (n = 9, k = 7, d = 7). Here 
regenerating code is made of two components - an outer MDS 
code (31,30) and a weak fractional repetition code based on 
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Fig. 5. Weak Fractional Repetition code based on generalized ring con- 
sturction for = 31, p = 2 and n = 9. 



partial regular graph shown in Figure [5] #y taking file x as a 
input to the outer MDS code we will get y = (2/1, 2/2, • • • , Z/31) 
as <9^/?w£ w/iere 2/1 = x 1,2/2 = ^2 , • • • , 2/30 = ^30 0^ 2/31 
w the parity packet. The coded packets will be saved on 
9 different nodes in the system according to partial regular 
graph shown in Figure [?] The whole file can be retrieved by 
recovering just 30 distinct packets out of these 31, due to the 
property of outer MDS code. As 30 distinct packets can be 
recovered by contacting to any 7 nodes out of these 9 nodes 
so whole file can be retrieved by contacting to any 7 nodes. In 
case of failure of a single node, its data can be recovered by 
contacting exactly 7 nodes and downloading 1 packets from 
each of these nodes, except in case of node Uq which has to 
contact only 6 nodes in case of failure. For instance, when 
node Ui fails, it contacts node U2,U3,U4,U5,U7,U8,U9 to 
get the packets y u y 10 , 2/23, 2/19, 2/25, 2/i8, 2/9 respectively. Node 
Uq saves only 6 data packets. So we put a default null packet 
as the 7 th packet. To recover this node on failure, user needs 
to contact only 6 other nodes. 

V. Construction of Generalized Ring Code with 

p = 2 

The construction of Section [IV| can be generalized to give 
rise both strong and weak fractional repetition codes. Suppose 
we have packets from set ft — {1, 2, . . . , 6} to be stored on n 
nodes C/i, C/2, • • • , U n such that p = 2. We first place n nodes 
on a circle (See Figure [6] for n = 9 and = 31 packets) and 
then starting from first node start placing the packets between 
successive nodes until all packets are exhausted. For n = 9 
and = 31 this gives the WFR code as shown in Figure [7] 
Here every packet has p = 2. Since 6 = qn + r, < r < n — 1 
this simple construction gives FR code for r = and WFR 
code for r > 0. This simple construction can be modified in 
different ways to obtained a general class of generalized ring 
codes. This will be included in the extended version of the 
paper. I 

VI. Conclusion 

Motivated by the emergence of fractional repetition codes 
which were based on regular graphs, we presented weak frac- 
tional repetition codes which are based on partial regular graph. 
WFR code gives regenerating codes for system parameters 
for which regular graphs does not exist. In this process, we 




Fig. 6. Generalized ring construction for = 31, p = 2 and n = 9. 
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Fig. 7. Weak Fractional Repetition code based on generalized ring con- 
sturction for = 31, p = 2 and n = 9. 

constructed WFR codes of repetition degree 2. As FR code 
exists where nd — even and we presented WFR codes for 
nd = odd, by combining these two, we get regenerating 
codes for single node failure for all feasible values of system 
parameters for any DSS. However WFR codes of higher 
repetition degree can be used for multiple node failure i.e. 
p > 2 as it gives more flexibility to a DSS. In future, weak 
fractional repetition codes could be constructed by Steiner 
system but distribution of default null packets remains an open 
problem. We also presented a simple modular construction 
of fractional repetition codes for p = d and n = and a 
generalized ring construction that gives both strong and weak 
fractional repetition codes. It would be an interesting future 
task to generalize some of these constructions for p > 2. WFR 
codes could be useful for heterogeneous distributed storage 
systems (20). 
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